Axiomatic geometrical optics, Abraham-Minkowski controversy, 
and photon properties derived classically 
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By restating geometrical optics within the field-theoretical approach, the classical concept of a 
photon (and, more generally, any elementary excitation) in arbitrary dispersive medium is intro- 
duced, and photon properties are calculated unambiguously. In particular, the canonical and kinetic 
momenta carried by a photon, as well as the two corresponding energy-momentum tensors of a wave, 
are derived from first principles of Lagrangian mechanics. As an example application of this formal- 
ism, the Abraham-Minkowski controversy pertaining to the definitions of these quantities is resolved 
for linear waves of arbitrary nature, and corrections to the traditional formulas for the photon ki- 
netic energy-momentum are found. Several other applications of axiomatic geometrical optics to 
electromagnetic waves are also presented. 
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I. INTRODUCTION 



A. Motivation 



The discussion about how to define the momentum and 
the angular momentum of a photon in dispersive medium 
(PDM), and even simply of a classical wave, recurs in lit- 
erature periodically during the last hundred years. The 
recent burst of theoretical [1-53] and experimental [54- 
56] publications indicates both an abiding interest in the 
problem and, apparently, a lack of consensus or certainty 
about what the correct answer is. The traditional argu- 
ments can be found in reviews like Refs. [57-63] and ref- 
erences therein, too numerous to be listed in this paper. 
Let us mention only briefly that two alternative forms of 
the PDM momentum are adopted most commonly: 



p M = hiorip/c, p A = hu>/(n g c), 



(1) 



which are known, respectively, as the Minkowski inter- 
pretation and the Abraham interpretation [64]. (Here ui 
is the frequency, c is the speed of light, and n p = c/v p 
and n g — c/v g are the refraction indexes associated with, 
correspondingly, the phase velocity v p and the group ve- 
locity w g ; for the two associated angular momenta see 
Ref. [46].) Since both have supporting theoretical and 
experimental evidence [1], the question about which of 
the two interpretations is "more correct" has been con- 
troversial. 

A resolution to this Abraham-Minkowski controversy 
(AMC) was proposed recently in Ref. [1]. It was argued 
there that both interpretations are correct; namely, pyi 
can be attributed as the canonical momentum, and pa 
can be attributed as the kinetic momentum of a pho- 
ton. Yet, strictly speaking, the argument of Ref. [1] ap- 
plies only to the case of a nonrelativistic solid dielec- 
tric. The subsequent generalization in Ref. [6] is not 
quite complete either; for example, the latter neglects 
electrostriction and magnetostriction, kinetic effects, and 
spatial dispersion, and also attributes v g entirely to the 



Poynting flux, in disagreement with a textbook theorem 
[Eq. (136)]. Thus, a quantitative relativistic theory is 
still lacking that would correct the existing understand- 
ing of PDM, and Eqs. (1) in particular. The purpose 
of this paper is to resolve these issues in a consistent 
manner and, through that, formulate a comprehensive 
asymptotic theory of linear waves of arbitrary nature. 



B. Field-theoretical approach 

Before photon properties can be calculated, the PDM 
itself must be defined unambiguously. (In particular this 
means that, contrary to the common presumption, the 
PDM properties cannot simply be inferred from experi- 
ment without a theory of what is being inferred.) Second 
of all, the definition should not be expected to originate 
from electromagnetism, because the concept of a pho- 
ton, and even of v p and v g that enter Eqs. (1), is not 
embedded in Maxwell's equations per se. On the other 
hand, the photon concept is neither entirely of quantum 
nature [65] , and mechanical properties of quantum radi- 
ation (dipole force, radiation pressure, cooling effects on 
atoms, etc) are consistently shown to have direct classical 
analogs [66-70] . It then stands to reason that an abstract 
classical calculation could resolve the AMC, generalizing 
Eqs. (1), without assuming a specific underlying physical 
system whatsoever. 

To understand what the right framework is for such 
a calculation, notice that introducing a photon implies 
that the frequency w and the wave vector k are well de- 
fined. These are exactly the validity conditions of the 
asymptotic theory commonly known as geometrical op- 
tics (GO). (The term "optics" here means only that the 
theory deals with sufficiently large w and fc; i.e., waves 
need not be electromagnetic.) Although usually defined 
through rays and wave equations [71-79], the most fun- 
damental, axiomatic GO is an abstract field theory that 
applies to any field having a Lagrangian density of a spe- 
cific form [Eq. (11); dissipative effects can also be added, 



Sec. IV D]. Just like Newton's laws of particle motion 
hold, with obvious exceptions, independently of specific 
forces acting on particles, the basic GO equations are 
then invariant to the wave nature [80] , and the wave prop- 
erties can be derived in general. Hence, axiomatic GO 
should resolve the AMC automatically and transparently. 



C. Outline 

Here we aim to apply the GO formalism toward de- 
riving PDM general properties deductively using noth- 
ing more than first principles of classical mechanics. In 
doing so, we draw on the Lagrangian field theory as elab- 
orated in plasma physics and hydrodynamics during the 
last fifty years [81-109]. Since this literature, sadly, re- 
mains unknown within the mainstream approach to the 
AMC (with few exceptions), the general formalism of ax- 
iomatic GO will also be restated. 

Specifically, below we do the following: 

(a) formulate from first principles a comprehensive the- 
ory of axiomatic GO, extending and expanding on 
existing results in applications to waves of arbitrary 
nature (not just electromagnetic waves); 

(b) explain how the wave canonical energy-momentum 
tensor (EMT) is related to the photon properties 
in the Minkowski interpretation (here and further 
photons arc understood as any elementary excita- 
tions, not just light quanta; Sees. Ill A and VI F); 

(c) introduce the wave angular momentum and pho- 
ton (plasmon, phonon, polariton, etc.) spin within 
axiomatic GO and calculate it explicitly for cylin- 
drical beams, also of arbitrary nature; 

(d) derive the effect of local linear dissipation; 

(e) unambiguously define the wave kinetic EMT and 
calculate it explicitly for isotropic relativistic fluids 
(with striction effects included); 

(f) calculate the associated energy, momentum, and 
angular momentum per photon (plasmon, phonon, 
polariton, etc.); show that the traditional, Abra- 
ham's formulas are reproduced as a limiting case; 

(g) illustrate how the properties of electromagnetic 
waves, considered as a special case, can be inferred 
deductively within axiomatic GO; 

(h) finally, responding to the questions posed in 
Ref. [35], we clarify the applicability of Minkowski's 
and Abraham's formulas for electromagnetic waves 
in various media (including cold, warm, and rela- 
tivistic) and present examples. 

Note that, in parts (a) and (b), which correspond to 
Sees. III-IVB, we mostly repeat known arguments, pub- 
lished previously, e.g., in Refs. [84-88]. Also keep in mind 



that, in application to specific media, the problem of find- 
ing both canonical and kinetic EMT of a classical electro- 
magnetic wave was solved comprehensively in Ref. [88], 
which, while known within the plasma physics commu- 
nity, seems to remain unknown to the general readership. 
The difference between Ref. [88] and our paper is that 
we use different machinery to arrive at results that are, 
in certain aspects, more general and, as a consequence, 
more concise and transparent. In particular, our paper is 
not about electromagnetism but rather about basic physics 
of waves, so we need not specify the wave nature; also we 
derive photon properties, and allow for dissipation, com- 
plementing Ref. [88] on these issues. 

The paper is organized as follows. In Sec. II we intro- 
duce the notation used throughout the text. In Sec. Ill 
we describe general GO waves, including nonlinear waves, 
in arbitrarily curved spacetime and also in the Minkowski 
spacetime as a particular case. In Sec. IV we reduce 
the theory further to describe linear waves and explain 
how the Minkowski representation is recovered; in par- 
ticular, the wave angular momentum and dissipative ef- 
fects are discussed. In Sec. V we introduce the wave 
kinetic EMT and reproduce the traditional formulas for 
the corresponding photon quantities as a limiting case. In 
Sec. VI we consider, as an example, how the specific prop- 
erties of electromagnetic waves flow deductively from the 
general theory. Sec. VII explores ramifications of our 
findings and summarizes our main results. Auxiliary cal- 
culations are presented in Appendix. 



II. NOTATION 

The following notation will be assumed below. We 
use the symbol = for definitions. Greek indexes span 
from to 3 and refer to coordinates in spacetime, x a . In 
particular, for the Minkowski spacetime we adopt x° = 
ct, where t is time. Hence the Lorentz transformation 
matrix, A"^ = dx a /dx'P , is given by 

A° = 7, A°» = JVi/c, A* = iv l /c, 



A' 



s'i + ij-iyvj/v 2 



(2) 



where v l is the velocity of the "primed" reference frame 
with respect to the laboratory frame, and 7 = (1 — 
v 2 /c 2 ) -1 / 2 . Latin indexes i, j, and I span from 1 to 3 
and refer to spatial coordinates, x l . Spatial vectors are 
denoted with bold, X; spatial tensors are also marked 
with hat, T; symbols like XY = Z stand for spatial 
dyadics, Z % i = X % Y^\ the symbol 1 denotes the unit spa- 
tial tensor; besides, the three-tensor 



7-1 



(3) 



is the spatial part of h. a ^. Summation over repeating 
indexes will also be implied; e.g., X l Yi = J2i=i X l Yi. 

Latin indexes (excluding i, j, I, and non-bold roman, 
as in n p ) denote partial derivatives with respect to the 



corresponding variables; e.g., for / = f(a,ui, k; i, x), the 
symbol / x denotes the derivative (gradient) with respect 
to the last argument, x. In addition to those, we also 
introduce "full" temporal and spatial derivatives, dtX = 
dX/dt and diX = dX/dx l , which treat all arguments of 
(any) X as functions of, correspondingly, t and x 1 . For 
instance, for the above /, one has 

dtf = fa d t a + f u d t iu + f ki d t h + ft, (4) 

dif = f a did + fu diuj + f kj dikj + f Xi , (5) 

while dt<x(t, x) = at(£,x), etc. The symbol V denotes 
the associated full covariant derivative; e.g., V if = dif 
is the full gradient of the scalar /, and V • F is the full 
divergence of the vector F, 



V - F= ^^)< 



(6) 



where r\ = det^-, and rjij — rjji is the spatial metric. [In 
Cartesian coordinates, Euclidean space has rfo = if 3 = 
diag (1, 1, 1), so r\ — 1.] The symbol >a denotes the analo- 
gous (to di) full derivative with respect to x a , and - a de- 
notes the analogous full covariant derivative. For exam- 
ple, the four-divergence is 



'*--^sf<^ 



(T) 



where g = — det<? M „, and g^ v — g„ M is the spacetime 
metric. For introduction to the tensor notation and index 
manipulation rules in particular, see Refs. [71, 110, 111]. 
Some specific symbols are also summarized in Table I, 
and the abbreviations used in the text are as follows: 

ACT action conservation theorem, 

AMC Abraham-Minkowski controversy, 

EMT energy-momentum tensor, 

GO - geometrical optics, 

PDM photon in dispersive medium, 

SAM spin angular momentum, 

WMS "wave + medium" system. 



III. GENERAL WAVES 

A. Covariant formulation 

First, let us consider a general nondissipative wave de- 
scribed by some action integral S = J Q^/gd^x, where 



^fgd x = yfgdx dx dx dx 



(8) 



is an invariant volume element in spacetime, and the four- 
scalar £ is the Lagrangian density. Since the action of 
the underlying medium is not included here, no invari- 
ance requirements on £ are imposed. Instead, we assume 
that the wave structure remains hxed (albeit not neces- 
sarily sinusoidal), so the wave is fully described by some 



canonical phase 9, which will be understood as a scalar 
field 9(x l/ ), and a — a{x v ), which is an arbitrary measure 
of the wave local amplitude [112]. We also assume that 
the envelope evolves on spacetime scales large compared 
to those of local oscillations. On such time scales, it is 
only the average Lagrangian density that contributes to 
S, so one can adopt that £ does not depend on 9 explic- 
itly. Instead, £ must depend on the phase four-gradient, 



fcp = 



(9) 



which is the generalized "wave vector" (actually, a four- 
covector here), obviously having zero four-curl, 



Ki, 



e , 



e. 



o. 



(10) 



[Equation (10) is known as the consistency relation.] Be- 
sides that, £ must depend on a; yet the dependence on 
the amplitude gradients a „ is negligible in the GO limit. 
Thus, allowing also for slow parametric dependence on 
the spacetime coordinates x v , we postulate 



£ = £(a, kf, 



(11) 



which as well can be considered as the definition of the 
GO approximation. Hence wave equations are inferred 
using the least action principle, namely, as follows. 

First, let us consider variation of S with respect to the 
wave amplitude a. Since S a S — J £ a 5a y/g d*x for any 
5a, the requirement 5 a S = leads to 

£a = 0. (12) 

Equation (12) can be understood as the wave dispersion 
relation, and it is generally nonlinear, i.e., may retain 
essential dependence on a (see, e.g., Refs. [106, 108]). 

Second, let us consider variation of S with respect to 
the wave phase 9 [113]. Due to Eq. (9) and the fact that 
£ does not depend on 9 explicitly, for any 59 one has 

5 9 S = f£ k J9^^gd i x 

= I [(V9&k, 66)0 - (^gZ ktl ),^5e]d 4 x 

= -J(£ k J^59^gd i x, 

where we used the fact that the wave field vanishes at 
infinity, so f(. . .) j(U d A x = 0. Thus, the requirement 
5qS = yields that the four-divergence of the action 
flux density J 11 = — Z k is zero [114], 



J^-.u = 0, 



(13) 



which is called the action conservation theorem (ACT). 
Since the ACT has the form of a continuity equation, one 
can treat Q^ = J^ /h as the flux density of some ficti- 
tious quasiparticles, or "photons" . (In application to spe- 
cific waves, one can as well think of plasmons, phonons, 
polaritons, or any other elementary excitations instead. 
See also Sec. VI F.) However, remember that, within our 
classical description, it is only the product HQ^ that has 



TABLE I: Summarized here is some notation adopted for wave variables ("ponder." stands for ponderomotive; the integral 



quantities are obtained by integrating the corresponding densities over the spatial volume, dV = y^rf' 
notation is explained in Sec. II and throughout the text. 
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an explicit physical meaning, so the actual value of H will 
be irrelevant for our purposes. 

Finally, let us also introduce the wave EMT as follows. 
Consider the (generally asymmetric) tensor 



Tj = k a j' 3 + %£. (14) 

The divergence of Tj equals 

Tj.0 = k ai pj p + k a J p ifi + 5'l{Z a a,? + Z kx k x , fi + Z xP ) 
= k a: pj fj + 5i{£ kx k X ;fS + £ x e) 

= ka\fiU K\\aU T ^x a 

= ka\fiU k a ;\U + "i-rr" 

= £x-, (15) 



where wc used Eqs. (11)-(13). This tensor is then associ- 
ated with the conservation law, T a -p = 0, yielded when 
the system is translationally invariant in spacetime (i.e., 
when the four-force is zero, £ 2 c = 0). Hence, T a is a 
true canonical EMT [115] , as one could also infer from the 
standard definition that is based on Noether's theorem 
[116]. However, notice that, in contrast with the funda- 
mental theorem of the vacuum field theory [71, Sec. 32], 
T a/3 does not permit the usual [88, 101, 117-120] sym- 
mctrization, since £ is not restricted by any invariance 
requirements [81]. (Yet see Ref. [121] for symmetrization 
via adopting an effective, "optical" metric.) In particu- 
lar the very fact that a scalar field such as 0(x v ) yields 
an asymmetric EMT already proves the lack of Lorcntz 
invariance [122, Sec. 5.6]. 



B. Application to the Minkowski spacetime 

From now on, we will assume the Minkowski spacetime 
with metric signature (— , +, +, +); hence, 



.9oo = g 



00 



-1, 



Vij = 9ij, V = 9- 



(16) 



(Although the space is Euclidean, we will allow for curvi- 
linear coordinates; thus, albeit flat, the spatial metric rjij 
can otherwise be arbitrary.) In this case, k a = (— w/c, k), 
and k a = (u)/c, k), where 

u = -d t 6, k = V6. (17) 

Then Eq. (10) turns into the following set of equations: 

9 t k + Vcj = 0, Vxk = 0. (18) 

One may notice also that the latter equation here can 
be considered as the initial condition for the former one, 
taking curl of which readily yields 9j(V x k) = 0. 
Accordingly, Eq. (11) becomes 



£ = £(a, w,k;t, x). 



(19) 



The dispersion relation hence holds in the form (12). The 
ACT can be rederived from Eq. (19) or it can be deduced 
from Eq. (13) by substituting J a = (cX, JT); either way, 
one gets (cf. Rcfs. [85, 86]) 



d t l + V • J = 0, 



(20) 



where I is the action density, and 3 is the action spatial 
flux density, introduced as follows: 



I = SL 



J = -£ k . 



(21) 



In particular, integration of Eq. (20) over the volume 
dV = ^/rjd 3 x yields conservation of the integral action, 



1=1 XdV = const. 



(22) 



Introducing the photon density Af = I/h and the photon 
spatial flux density Q = J /h, one can further rewrite 
Eq. (20) as d t M + V • Q = 0, and Eq. (22) will yield 
the photon conservation, N = J Af dV — const. Also 
notice that both / and N are Lorentz invariants, as well- 
known to flow from the general (unlike, e.g., in Ref. [123]) 
properties of the continuity equation [111, Sec. 2.6]. 
The elements of the (contravariant) EMT are now 



T 00 = uj! - S, T°* = uj*/c, 
T° = ck l l, T ij = k l J' J + n ij S. 



In particular, Eq. (15) yields 



dt 



J__d_ 
^Jr\ dx l 



(cT 0l Vv) = w, 



(23) 



(24) 



which is a continuity equation for T 00 with the right- 
hand side being w = g 00 c£, x o — —Sit- Since the latter has 
the meaning of the canonical power source, £ = T 00 must 
be the wave canonical energy density, and Q l = cT 0t 
must be the canonical energy flux density. Similarly, 



1 dT i0 Id 



(25) 



which is a continuity equation for the three- vector T t0 /c 
with the right-hand side being f = £ x . Since the lat- 
ter has the meaning of the canonical momentum source, 
■pi = 7"«y c mus ^ De |; ne wav e canonical momentum den- 
sity, and the (generally asymmetric) three-tensor IT- 7 = 
T 13 must be the canonical momentum flux density [124]. 
In summary, one then has 



j-a0 _ 




(26) 



where the individual blocks are given by 

£ = ool-S, Q = ujJ, 
V = kl, Il = kJ + Si, (27) 

and Eqs. (24) and (25) can be written as follows: 



d t £ + V-Q = w, d t V + V • n = f . 



(28) 



It is hence seen that the wave energy propagates at veloc- 
ity 0,1 £ that is generally different from the action flow 
velocity 3 ' /I [cf. Eq. (20)], and similarly for the mo- 
mentum flow velocity. Moreover, those three turn out 
to be different from the velocities of information, or the 
nonlinear group velocities, of which there can also be 
more than one. For an expanded discussion on this see 
Rcfs. [86, 1091 and references therein. 



IV. LINEAR WAVES: MINKOWSKI 
REPRESENTATION 

A. Basic equations 

Now let us consider a linear wave, i.e., such that has 
w(k;t,x) independent of a. In this case, from Eq. (12) 
it is seen that S a must be separable as S a — S(o;,k) J 4 a , 
where A(a, u>, k) is some function such that A a is nonzero. 
[Parametric dependence of functions like S, 25, and A on 
(£, x) is also implied but will be omitted for the sake of 
brevity] Then, 



£ = !D(w,k)A 



(29) 



It will hence be convenient to think of a as of a linear 
measure of the oscillating field amplitude. Then, most 
commonly, one will have A ex a 2 ; yet for our purposes 
the actual dependence need not be specified. 
Equation (12) now yields 

2)(w,k) = 0. (30) 

Thus Eqs. (21) become 

Z = 2> U i4, J = -V k A, (31) 

and Eqs. (27) take the form 

£ = ul, Q = ujJ, V = kl, Il = kJ. (32) 

Hence the photon canonical energy, H = £/Af, and the 
photon canonical momentum, P = "P/JV, equal [84] 



H = Hlu, 



fik, 



(33) 



matching the Minkowski interpretation exactly and inde- 
pendently of the wave nature. (In fact, P = Kk holds 
even for nonlinear waves [cf. Eqs. (27)], albeit assuming 
a fixed ratio of a and the amplitude of each nonncgligiblc 
harmonic.) In particular, P a = (H/c, P) = hk a happens 
to be a true four- vector, by definition of k a , so P a P a is a 
Lorentz invariant. The latter can also be understood as 
a measure of the photon canonical mass 2H, defined via 



OJl 2 = -P a P a /c 



(34) 



(cf., e.g., Refs. [103, 125, 126]). 

Further, differentiating Eq. (30) with respect to k [with 
u = w(k; t, x)] also gives £) w v g + £>k = 0, where we in- 
troduced the linear group velocity v g = Wk; therefore, 



v g = -S k /S w = JIT. 
Hence, Eq. (26) yields T a/3 = AfT " 3 , where 



rpa/3 



hio fiwVg/c 



chk ftkv 



(35) 



(36) 



- 



is the canonical EMT per photon. Alternatively, one can 
also exclude Af and rewrite Eqs. (32) as 



V = k£/cu, Q = £v g , n = Pv g 



(37) 



It is seen, from here and Eqs. (28), that the canonical 
action, energy, and momentum are all transported at the 
same velocity, v g . However, keep in mind that the full, 
or "kinetic" energy and momentum densities carried by 
the wave (Sec. V) generally do not have this property. 

Finally, let us introduce photon trajectories, (i(X = v g , 
also known as GO rays. Along those trajectories, 



d t = d t + v g • V. 



Then Eqs. (18) yield 
d t x = v s , 



d t k 



d t ui = LU t - 



(38) 



(39) 



[Remember that the derivatives w x and uj t of w(k;i, x) 
are taken at fixed k.] In particular, the ACT can hence 
be written as 



d t In I = -V • • 



(40) 



Also notice that Eqs. (39) can be understood as canonical 
equations for the photon motion governed by the Hamil- 
tonian i/(x, P;t). In this form, i.e., 



d t x = H P , d t P 



H, 



d t H = H t 



(41) 



they are identical to the motion of a true classical particle 
such as an electron, which supports the well-known anal- 
ogy between GO and classical mechanics [127, Sec. 9.8]. 
Reverting to Eqs. (11) and (19), it is seen then that not 
just waves, but classical particles too can be described in 
terms of phases and amplitudes [128]. 



B. Noether's integrals 



Various transport equations can now be derived from 



d t (XI) + V 
= {d t X)l + 
= (d t X)l + 
= l{d t X + 
= Zd t X, 



(XJ0 = 
x(d t i) + 

(VX)J 
v g ■ VX) 



{VX)J + X(V • J) 



(42) 



which holds for arbitrary X. Some of those are as follows. 
Action. - Taking X equal to a constant, one recov- 
ers Eq. (20), or the ACT. [Of course, this is not an in- 
dependent derivation of the ACT, since the latter itself 
was used in deriving Eq. (42).] As already emphasized, 
Eq. (20) is due to the fact that £ does not depend on 9 ex- 
plicitly. Since it also implies conservation of the integral 
action /, the latter can be understood as the correspond- 
ing Noether's integral. 



Energy. — Taking X = u, one obtains 
d t £ + V- (£v g ) =ld t uj. 



(43) 



As seen from Eq. (39), in stationary medium dtW — 0, so 
one recovers the result obtained in Sec. Ill, namely, that 
the wave integral energy, j£dV, is the Noether's inte- 
gral that is conserved when the system is translationally 
invariant in time. Another corollary, which is obtained 
by comparing Eq. (43) with Eq. (24), is that 



I dt<jJ = Icot, 



(44) 



where we also used Eq. (39). Alternatively, one can 
rewrite this as w = M d t H, where dtH is the work on 
an individual photon per unit time. 

Momentum. — Taking X = k, one obtains 



d t V + V-{Vv s )=ld t k. 



(45) 



As seen from Eq. (39), in homogeneous medium d t k = 0, 
so one recovers the result obtained in Sec. Ill, namely, 
that the wave integral momentum, J "P dV , is the 
Noether's integral that is conserved when the system 
is translationally invariant in space. Another corollary, 
which is obtained by comparing Eq. (45) with Eq. (25), 
is that 



f = ldt\a 



-lWv 



(46) 



where we also used Eq. (39). Alternatively, one can 
rewrite this as f = Afd t P, where dtP is the force on 
an individual photon. 

Angular momentum. — Taking X = xxk, one obtains 
from Eq. (42) that 

9 t M + V-(Aiv g )=Zd t (xxk), (47) 

where we formally introduced M. = (x x k)X, or 

M = xxV. (48) 

Based on Eq. (48), one could anticipate that M. is the 
wave angular momentum density, and indeed Eq. (47) 
yields that this is the case, as we will now prove. 

C. Angular momentum 

Conservation theorem. — Consider system rotation by 
an arbitrary infinitesimal angle Sip. Associated with this 
rotation will be a variation of the Lagrangian density 

5£ = £ k • 5k + £ x • 5x, (49) 

where we substituted Eq. (11) for £ a ; also, 

8k = Sip x k, <5x = Sip x x, (50) 



£k = 3 = — v g2^7 and £ x = Xd t k, where the latter is 
taken from Eq. (46). Hence, 

Z _1 (5£ = — v g • (Sip x k) + d t k ■ (Sip x x) 
= Sep ■ (v g x k) + Sip ■ (x x d t k) 

= 5ip-d t (xxk). (51) 

Having <5£ = yields that d t (x x k) = 0. From Eq. (47), 
one then obtains that 



d t M + V • (Mv g ) = 0, 



(52) 



which means, in particular, that J AA dV is conserved. 
Since this is the invariant associated with the medium 
isotropy, it by definition [129, Sec. 9] represents the wave 
angular momentum. Correspondingly, AA is the wave 
angular momentum density [130]. Also, M = AA/Af, or 



M = xx P. 



(53) 



is the angular momentum of a photon, ft dt (x x k) = dtM. 
is the torque on a photon (cf. Ref. [96]), and the corre- 
sponding dynamic equation is spelled out as 



d f M 



x x H„ 



(54) 



Spin angular momentum (SAM). - Consider a sta- 
tionary wave beam symmetric with respect to z axis; i.e., 
in cylindrical coordinates (r, <j>, z), the amplitude a and 
the wave vector components k r , k^, k z are independent 
of <j). The consistency relation (18) requires then that 
d r (rkj>) = 0, so kfj, — m/r, where to is a constant. This 
gives A4 Z = rk^I = ml, or that the carried angular 
momentum per photon is M z — mft To find to, notice 
that, due to k^ = r~ 1 d < p9, the wave canonical phase has 
the form 9 — mcj) — cot + S(r, z), where S is some func- 
tion of r and z only. Thus, after any time St, the wave 
must repeat itself, at the same r and z, in the coordinate 
frame rotated by S(f> = (ui/m) St. Satisfying this condi- 
tion are, in fact, only circularly polarized waves (at least, 
in free space), corresponding to m = ±1. Other types 
of wave beams therefore cannot be considered symmet- 
ric within GO and thus can be assigned only average to. 
Specifically, decomposing a wave with a given elliptic po- 
larization into the two independent circularly-polarized 
components with corresponding weights C+ and C_ , one 
gets (to) = C+ — C_ . In particular, linear polarization 
corresponds to C+ = C_, in which case (to) = 0. 

These results match the known quantum theorem, 
which says that states with circular polarization are 
the only polarization states of a free photon that are 
eigenstates of the corresponding SAM projection, M z = 
±fi [131, Sec. 8]. Thus, for an axially symmetric beam, 
.A/fz that originates entirely from the beam polarization 
can be called the SAM density. Interestingly, it can also 
be interpreted as follows. For those (circularly polarized) 
waves that do allow precise definition of the SAM, the 
latter appears due to the singularity of k^, at r = 0, i.e., 
due to 9(r = 0) being undefined [132]. In this sense, the 



canonical phase increment A9 = 2izm along a closed con- 
tour encircling the symmetry axis is the corresponding 
Berry phase [133, 134], so the photon SAM (in units ft) 
is nothing but the Berry index of the classical phase field. 
Finally, note that a wave beam that is not axially 
symmetric will also carry additional, "orbital" momen- 
tum [135, 136]. The latter is included in Eq. (48), and 
separating it from the SAM unambiguously may not 
be possible except in special cases, as usual; see, e.g., 
Ref. [135-137] or Ref. [131, Sec. 6]. 



D. Dissipation 

Suppose now that a linear wave experiences weak dis- 
sipation. Then, comprising the wave locally are Fourier 
harmonics with complex frequencies and wave vectors, 

n = n' + m", K = K' + iK". (55) 

Assuming the local dispersion relation in the form 

£>(fi,K)=0, (56) 

let us keep only the terms of the zeroth and first order in 
O" and K". Then one gets 



D + ilDoft" + iS K ■ K" = 0, 



(57) 



where 1) and its derivatives are henceforth evaluated at 
(fi', K'). Now suppose T> = 2)' + «£>", where £>" = Im!D 
is much smaller than T>' = RcS). One hereby obtains 



3D' + il)" + iD' n n" + i& K • K" = 



(58) 



(where higher-order terms were neglected), the real part 
and the imaginary part of which are, correspondingly, 



D'=0, 

s" + ®' n n" + Dk • K'' 



0. 



(59) 
(60) 



From here, the envelope dynamics is inferred as follows. 
At any given time, the field distribution of the real 
system can be mapped into the auxiliary nondissipative 
system, where the wave phase 9 is well defined, and 



£ = ID'(w,k)A 



(61) 



This defines the instantaneous a and also the instanta- 
neous real canonical frequency and wave vector, (w,k); 
hence all other local quantities can be introduced through 
£(a,w,k) too. However, the dynamics in the auxiliary 
system and in the real system are different; thus, for the 
latter, an extra term T must be added in Eq. (40), 



dt In I 



-v • v g - r. 



(62) 



Assume that dissipation is determined by the local 
(a,cj,k) and by the local parameters of the medium, 
rather than their gradients. Then one can find T by 



calculating it for homogeneous stationary medium and 
a wave whose field is locally "monochromatic", i.e., can 
be assigned particular complex (fl, K) [which map to the 
given canonical (w,k)]. Then, 



T = -dt In J = -x{Q" - v g • K"), 



(63) 



where x = dlnA/dhia (which commonly equals 2; sec 
Sec. IV A), and the left-hand side is evaluated at (O', K'). 
On the other hand, Eq. (60) yields 



ft" - v ff • K'' 



-3)"/S>n 



(64) 



Hence T is connected with the dispersion function as 

r(w,k) = xS)"(«,k)/2)^(fa;,k) ) (65) 

where we used that, to the leading order, it is sufficient 
to take (J7',K') s=s (w,k) on the right-hand side. 

Now let us present the corresponding transport equa- 
tions. Similarly to Eq. (42), one has, for any X, that 



d t (XI)+V -(XJ) =ld t X-TXI. 



(66) 



Since X is arbitrary, the number of equations that can 
be produced from here is infinite, like in Sec. IV B. In 
particular, those for the action, the energy, the momen- 
tum, and the angular momentum are obtained by taking 
X = l,X = oj,X = k, and X = x x k, correspondingly, 
and are as follows: 

d t 2 + V • (Iv g ) = -VI, (67) 

<9 4 £ + V-(£v g ) =Td t uj-T£, (68) 

d t T + V ■(Vv e )=ld t k-TT, (69) 

9 t M + V.(Mv g )=Id t (xxk)-rM. (70) 

The physical statement contained in these is twofold. 
First of all, one can see that the decay rate is the same 
in all the equations, regardless of the specific X. [This, of 
course, is seen already from Eq. (66).] Second of all, this 
rate is actually known from Eq. (65), which connects T 
with the dispersion function "D. In particular, the action 
loss per unit volume per unit time can be written as 



= Y1= x&'A, 



(71) 



and the corresponding losses of the wave energy, momen- 
tum, and angular momentum are given by 

Wloss = w *lossj Moss = "-iloss, Mloss = (X X Kjli oss . 

Also notice that dtui and c^k entering Eqs. (67)-(70) can 
be taken from the GO ray equations. Since based en- 
tirely on Eqs. (17) and (18) (Sec. IV A), those happen to 
be unaffected by dissipation; i.e., they are still given by 
Eqs. (39). Hence, the above results can be interpreted as 
follows: local dissipation does not affect individual pho- 
tons but rather changes the photon density. 

For an explanation of how the results reported here 
apply to electromagnetic waves, see Sec. VI. The same 
results are also applicable to dissipation-driven instabil- 
ities (r < 0). Nondissipative instabilities can be accom- 
modated within GO too, namely, by allowing for complex 
rays; for details see Rcf. [138] and references therein. 



V. LINEAR WAVES: ABRAHAM 
REPRESENTATION 

A. Basic definitions 

In addition to the wave canonical, or Minkowski EMT 
that we discussed so far, one can also introduce the cor- 
responding so-called kinetic, or Abraham EMT, 



r a/3 



£ -d/c 
Cp 7T 



(72) 



It is defined such that, being a part of the complete EMT 
that describes the "wave + medium" system (WMS), T a ^ 
comprises all the wave- related (i.e., a-dependent) dynam- 
ics of the medium and fields. We hence express it as 
T ap = j-a/3 + At"* 3 , where At Q/3 is the "ponderomotivc" 
part that is stored in the medium, and, similarly, 

e = £ + As, p = V + Ap, p = M + Ap. (73) 

In particular, notice the following. Since the WMS is 
closed and thus Lorentz-invariant, its complete EMT is 
symmetrizable [88, 117, 118]. Yet its unperturbed part 
is symmetrizable by itself (because it describes a closed 
system too, namely, the medium absent a wave), so t Q/9 is 
also symmetrizable separately. On the other hand, since 
T a P is proportional to the wave intensity, it is defined 
uniquely and, therefore, must be symmetric. This yields 
p = fl/c 2 , and 



p = x x p 



(74) 



holds automatically [71, Sec. 32]. Also, since the integral 
energy-momentum of the whole WMS is defined uniquely 
[71, Sec. 32], and its a-dependent part is defined uniquely 
too, one can find (e/c, p) as the a-dependent part of the 
WMS canonical energy-momentum density. Given the 
WMS Lagrangian density, the latter can, in principle, 
be found straightforwardly in any specific problem [116]. 
However, the general answer is not informative (mean- 
ing that T a P is by itself a somewhat artificial construct; 
see also Ref. [121]). Thus, below, we consider only the 
particular model of an isotropic medium, most popular 
in the AMC context, yet still refrain from specifying the 
wave nature. 



B. Wave energy- momentum in isotropic medium 

General case. — - Consider an isotropic medium (such 
as gas, fluid, or plasma) comprised of elementary [139] 
particles or fluid elements whose dynamics absent a wave 
is described by some aggregate Lagrangian L. In the 
presence of a wave, the WMS Lagrangian is hence L + L, 
where L = J £ dV is the wave Lagrangian. Assuming 
that particles contribute to L additively, the latter can be 
written as L = |J°) — 'J2 e <frw, where lJ ) is independent 



of all particle velocities u^ , and each of the so-called 
ponderomotive potentials $W [106, 107], or dipole po- 
tentials [67, 68], depends on the specific u^ but not on 
other velocities. Omitting the index £, we can write the 
canonical momentum of each particle as the sum of the 
mechanical part d u L and the ponderomotive part — d u &, 
also yielding the ponderomotive contribution to the par- 
ticle canonical energy, — u • d u Q. (This energy should 
not be confused with the ponderomotive potential <J> it- 
self, which is a part of the wave canonical energy [140].) 
Thus, the densities of the ponderomotive momentum and 
energy stored in particles can be written as follows: 



Ap = -^n (s) (5 u $) (s) , 

S 



(75) 
(76) 



where the summation is taken over different species, n^' 
are the (locally averaged) densities of those species, and 
angular brackets denote averaging over velocities within 
the corresponding ensembles. 

Fluid model. — If a medium can be modeled as a sin- 
gle fluid (in particular meaning that kinetic effects are 
inessential, unlike, e.g., in warm plasma), one can sim- 
plify Eqs. (76) and (75) further, namely, as follows. First 
of all, notice that the velocities u of fluid elements are all 
equal to a single velocity v, so Eqs. (75) and (76) become 



Ap = -n <9 V $, Ae = v • Ap. 



(77) 



It is hence convenient to rewrite Eqs. (77) in terms of 
Lorentz- invariant proper parameters of the medium [141]. 
Since <f> that enters here depends on the wave intensity, 
it must be gauge-invariant; thus, being (minus) the in- 
teraction Lagrangian of a single element, it transforms as 
$ = $'/7 [142], with primes in this section (Sec. V) de- 
noting the medium rest frame, and 7 = (1 — v 2 /c 2 ) -1 ' 2 . 
Also, n — 771/, where n' is the proper density, correspond- 
ingly. Since the latter does not depend on v, we then get 
Ap = —d v (n'<&') +7 2 vn'<I>'/c 2 . Further, let us denote 



n'fc' = £' - £'(°) ±14', 
where £'(°> is L'(°) per unit volume, and introduce 



9t 



9 

7 z v 



W, 



(78) 



(79) 



understood as the striction contribution (Sec. VI B). 
Since £'(°) is also independent of v, one then can write 



Ap = <9 V £' + 9\. 



(80) 



Due to the fact that a Lagrangian density is a four- 
scalar, £' that enters Eq. (80) can also be replaced with 
£ [143]. However, using £'(a',k') is preferable, because 
it cannot depend on v explicitly, but rather depends on 
it solely through a' and k' . [Remember that the velocity 



derivative in Eq. (80) must be taken at fixed a and fc M .] 
Due to £f a , = [cf. Eq. (30)], we then get 



d v £,' = -(d v A\)KJ'», 



(81) 



where we substituted the (covector) Lorentz transfor- 
mation (2), i.e., k' = h^ftkv On the other hand, 
k v = (A _1 ) A v k' x , so Eq. (81) can also be written as 

9 V £' = - 7 G\T A '7 C , (82) 

where we introduced a dimcnsionless matrix function 

G\(y)±(c/ 1 )(A- 1 )\(d v A%). (83) 

As shown in Appendix, Eq. (82) is also equivalent to 

9 V £' = 7 Tr(GT')/c = «£ + 58, (84) 

where the terms on the right-hand side are defined as 



S3 



*P = 7 A 
7 2 



-V 



7+1 



- x \-^xV 

c V c 



(85) 
(86) 



Yet, v' is parallel to k in isotropic medium, so 23 van- 
ishes, and we finally get 

Ap = «p + CH, Ae = v(<p + 9t). (87) 



C. Wave EMT in the isotropic-fluid model 

Within the isotropic-fluid model, one can hence explic- 
itly construct the complete kinetic EMT of a wave, 



r afi = a q ^ <y^, 



which is done as follows. 

Energy and momentum. — First of all, let us combine 
Eqs. (73) and (87) with Eq. (85) for «£, Eq. (79) for 9\, 
and £ = 10X and T = kX, as well as with 



X = 7 T'(1 + v • v' /c 



' /„2\ 



(89) 



where we employed the four- vector transformation prop- 
erties of J a . This yields 



£ = 7 2 £' + ^-(a. v :, 



2 2 

7 v 



U', (90) 



P = 



i£' 



a-v; + 7 v + -(vv;; 



^V. (91) 



[Entering the numerator in Eq. (91) is actually k, not k .] 
By taking v = here, we then get, in particular, 

e' = £', cf? = 0'/c = S'Vjc, (92) 

also using that r a ^ is symmetric in all reference frames. 
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Momentum flux density. — Since k is the only desig- 
nated direction in the medium rest frame, the (symmet- 
ric) momentum flux density 7r must be a linear superpo- 
sition of k k and 1 , or, equivalently, tv =^k v' +(1 , 
where ip and £ are some coefficients. Combining this 
with Eqs. (88) and (92) and plus with, e.g., Eq. (90) for 



— ^oo 



e = r 



one readily obtains ip = X' and C, —U'\ i.e., 



(93) 



*' = 5'kV g /w' + w'r. 

(In particular, if v g = 0, the term W acts as the pondero- 
motive pressure; cf. Rcf. [144].) Equation (91) then flows 
from Eq. (88) automatically; yet, Eq. (88) also gives 



oj'Ck' -v'£') 

c 2 |k'l 2 



7 2 vv 



u/ 2 

7W 



£' 



7 2 vv 



W. (94) 



EMT and ponderomotive forces. - The wave kinetic 
EMT in isotropic fluid is hereby summarized as 



i a p 



£ £v g /c \ 

\£v g /c £\cv g /oj +14 1 J 
in the medium rest frame and is transformed to other 



(95) 



frames via Eq. (88), as also spelled out in Eqs. (90), (91), 
and (94). In particular, if the flow velocity is negligible 
in a given frame, one can take A"^ rj d a , so 



ew£, p«£v g /c 2 , n w (x x v g )£/c 2 



(96) 



Finally, the ponderomotive four-force density f a that a 
wave imparts to a medium also can be calculated [115], 

r = -r^ ;/3 , (97) 

whence, substituting f a = (w/c,f), one obtains 

w = -d t e -c 2 \7 ■ p, f = -d t p - V -it. (98) 

(Here w has the meaning of the power density input into 
the medium, and f is the usual three- force density.) In 
particular, note that since r a/3 is expressed through quan- 
tities derived from £' and £'( ' , which are the fundamen- 
tal invariants of the wave, the usual ambiguity in calcu- 
lating the forces on the medium is hence avoided. 

The above results, which rely essentially only on 
Eq. (11) and the isotropic- fluid approximation (without 
any reference to clcctromagnetism) , represent a more 
concise and transparent version of those reported in 
Ref. [88] and generalize the latter to the case of waves 
of arbitrary nature; see also Sec. VI B. 



D. Photon kinetic properties 

The following energy, momentum, and angular mo- 
mentum can now be assigned to a single photon: 



Keep in mind, however, that these are merely quanti- 
ties per photon rather than the momenta of a photon, 
in contrast with (H, P,M) that actually enter the pho- 
ton motion equations [Eqs. (41) and (54)]. As a result, 
(h, p, m) do not enjoy the simple transformation prop- 
erties of their canonical counterparts. In particular, the 
kinetic four-momentum p a = (h/c, p) is generally not 
a four- vector. One can easily check this, e.g., by using 
p a = (cN)- 1 J r a0 dV with t q/3 taken from Sec. V C and 



AA = 7 AA'(l + v-v;/c 2 ), 
£' = hu'AP, V = Kk'Af'. 



(100) 
(101) 



Still, simple expressions are obtained from Eqs. (96) 
for isotropic fluid medium at rest; namely, 

h w fko, p w hwv g /c 2 , m w (x x v g )hw/c 2 . (102) 

Since here v g is assumed to be parallel to k, one also gets 
that p is parallel to P, m is parallel to M, and 



p/P = m/M « l/(n p n g ) 



(103) 



These match the traditional Abraham's formulas [1, 46], 
hence seen to hold for waves of arbitrary (not necessar- 
ily electromagnetic) nature. Yet it is clear now that the 
traditional formulas are, in fact, approximate and gener- 
ally invalid for moving and non-fluid media, in contrast 
with Minkowski's formulas for the canonical quantities 
[Eqs. (33) and (48)], which are more universal. 



VI. LINEAR ELECTROMAGNETIC WAVES 

Finally, let us apply the above results to illustrate how 
the properties of linear electromagnetic waves can be cal- 
culated explicitly within our general approach, without 
using Maxwell's equations for the wave envelope. Note 
also that similar calculations can be performed for non- 
linear waves too, for which £ can be constructed from 
first principles as well [106-109]. 



A. Wave Lagrangian 

Basic equations. - First, let us consider a nondis- 
sipative wave, as usual. The wave Lagrangian density 
(derived independently, e.g., in Refs. [11, 88]) can be ex- 
pected in the form £ = £(°) — U, where 

£(°) = — (E* • E - B* • B) (104) 

16-7T 

is that in vacuum [106], E and B are the electric and 
magnetic field envelopes, and U is the potential energy 
density of the wave-medium interaction [cf. Eq. (78)]. 
For linear, i.e., dipolar interaction, we can take [145, 
Sees. 4.2, 4.8, 5.7, 6.2] 



h = e/Af, p = p/Af, m = fi/Af. 



(99) 



U = --Rc 

4 



E* P 



B M 



(105) 



11 



Here P is the electric dipole moment density (i.e., the po- 
larization), and M is the magnetic dipole moment den- 
sity (i.e., the magnetization); also, one factor 1/2 comes 
from the time-averaging, and the other 1/2 comes from 
the fact that P and M are linear functions of E and B, 
correspondingly. Now let us introduce D and H via 



D = E + 4ttP = e • E, 
B = H + 4ttM = ft • H, 



(106) 

(107) 



assuming that the permittivity tensor e and the perme- 
ability tensor jj, (not to be confused with the kinetic angu- 
lar momentum density fi) are Hermitian so the assump- 
tion of zero dissipation be satisfied. One gets then [11, 88] 



£ = 



16tt 



E* e E-B* ■ (T l B 



(108) 



(here fi is the tensor inverse to [i), also meaning that 



U 



1 
16^ 



E ■(€-!)• E-B -(jj,- 1 -i)-B 



(109) 



In agreement with Rcfs. [106, 107], this implies assigning 
the following ponderomotive potentials to particles (or 
fluid elements) comprising the medium: 



$ 



-E -d-E/4-B -/3-B/4, 



(110) 



where a and (3 are the particle electric and magnetic 
polarizabilitics [68], and 



S 

/T 1 = i-^47rn (s) (/3) (s) . 



(Ill) 
(112) 



one can further rewrite Eq. (114) as follows: 



£ = 



E 

16^ 



e E 



kx 



(i l • (k x E) 



(116) 



Then, varying £ with respect to E yields the following 
dispersion relation: 



e E 



kx 



fi r 1 • (k x E) 



= 0, (117) 



in agreement with Maxwell's equations [146, Sec. 3.4]. 
Similarly, varying £ with respect to E yields the complex- 
conjugate equation. 

Alternatively, if the polarization vector e is prescribed 
(or considered as an independent field), one can as well 
introduce a scalar amplitude instead, say, a = |E|. This 
yields £ — S(u;, k)a 2 , with 1)(u), k) given by 



D = 



1 

167T 



e* • e • e ^(kx e*) • fi 1 • (k x e) 



(118) 



The dispersion relation that follows [Eq. (30)] is Eq. (117) 
multiplied by e*. 



B. Wave action and EMT 

Action. - The action density T is now obtained 
straightforwardly by differentiating £ [e.g., Eq. (114)] 
with respect to u>: 



1 

1&7T 



2 _-, 



E -e w -E+-H B-B • (/i^ 1 ^ • B 



Parametrization and dispersion. - Remember that 
there is a freedom in defining a, so there are various op- 
tions for how to parametrize the wave Lagrangian den- 
sity. First, let us consider E and E as independent vec- 
tor fields; i.e., a — (E, E ). In this case, it is convenient 
to write 



£ (o) = ^(E*.E-^|kxE| : 

16-7T 

(where we used that B = ck x E/w) and 

„2 



(113) 



1 
16^ 



E 



e • E - — (k x E (•P'(kxE) 



(114) 



correspondingly. Using that 
(kx E*) • jT 1 • (kx E) = 



(E x k) • fi r 1 • (k x E) = 

- E* • {k x [jj,- 1 ■ (k x E)]}, (115) 



where we used H B = B H, due to (i being Hermitian. 
From £ = [Eq. (30)], one also has 







E D = H B. 




(119) 


Thus, 1 = 1^ + 2 (B \ where 




j(E) _ J_ 
167T 


E* • e u ■ E + - E* • e • E 


1 


(120) 


16tt 


'1 
u 


H* • A • H - B* • (fi-% 


B 


• (121) 



One can further substitute 

B*-(/rVB = B* -(/rV/i-H = 

= -B* ■ /T 1 ■ £„ ■ ft = -H* ■ £„ ■ H, (122) 

where we used (£ _1 • jjl) u = 0. Therefore, 
1 



16iruj 



E -(we^-E + H -(w/i^.H . (123) 
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Canonical EMT. — - The elements of the wave canon- 
ical EMT [Eq. (26)] are readily obtained from Eq. (123). 
For completeness, we summarize them here once again: 



£ = u)X, Q = v K u)T, V = kl, 



n = kv g I. (124) 



Kinetic EMT. — Assuming that dissipation is negligi- 
ble and the medium is isotropic, the wave kinetic EMT, 
as well as the kinetic angular momentum, can also be 
found, namely, using the results from Sec. V. In the gen- 
eral case, one can employ Eqs. (75) and (76), substitut- 
ing Eq. (110) for $. In the isotropic-fiuid approximation, 
Eqs. (90), (91), and (94) can be used in combination with 
Eqs. (123) and (124) taken in the medium rest frame. 
Due to Eq. (109), one can also take, in particular, 



91 



2 

7 z v 



de |E| 2 
dn 167r 



dfi- 1 |B| 



dn 167r 



(125) 



where the expression in parenthesis (equal to the 
interaction-Lagrangian density —U) is Lorentz-invariant. 
Hence 9t can be attributed to electrostriction and mag- 
netostriction [88, 147]. Besides, one can show that 
Eqs. (170) and (171) of Rcf. [88], derived there from dif- 
ferent considerations, are recovered from our Eqs. (90), 
(91), and (94) as a special case. (The proof is straight- 
forward and will not be presented here.) For practical 
ramifications of our results in application to electromag- 
netic waves also see Ref. [88] and Sec. VII. 



According to Sec. IV D, we can infer X directly from 
Eq. (123) by replacing 33 with 33', so 



1 = 



1 



167ru; 



E* • (we% • E + H* • (w£% • H . (128) 



Then the known formula [72, Sec. 80] for the energy den- 
sity is recovered from £ = ujX. Other local properties of 
the wave are found from Eqs. (32) and (48), the dissipa- 
tion rate T is found from Eq. (65), and Eq. (71) yields 

i loss = i- (e* • e" ■ E + H* • A" • ft) , (129) 

where we substituted k = 2, since A = a 2 . The ex- 
pression for the dissipation power density, wi oss = u)i\ oss , 
hence also agrees with the known formula [72, Sec. 80]. 



D. Dielectric media 

Since B is proportional to E, one usually can define the 
high-frequency medium-response tensors e and /t such 
that ji = 1 (in a selected frame of reference). As this 
is done often, e.g., in plasma physics [146], let us also 
simplify some of the above expressions for this particular 
case. First of all, Eq. (108) yields 



1 

167T 



E • e ■ E 



IkxEl 2 



(130) 



C. Dissipative waves 

In the presence of dissipation, the dispersion relation 
flowing from Maxwell's equations is similar to that in 
Sec. VIA. Namely, it can be written as 25(0, K) = 0, 
where 33 has the same form as in Eq. (118), yet now with 



n = n 



Hi . 



(126) 



where e and fi are Hermitian, and ie and ifi are anti- 
Hermitian. Using that 



{ix' + ift") 1 



M 



ifj, ■ fi ■ fj, , 



(127) 



we can hence write, for 5) evaluated at real (w,k), that 
33 = 33' + i33", where 33' and 33" are real and given by 



& 



1 

167T 



(kxe*) -(l'- 1 .(kx e) 



33" = 



167T 



w 



(k x e*) • (i 



i-i 



■ jx" ■ (i 1 - 1 -(kxe) 



or £ = £(°) + E* • x ■ E/(16tt), where fi(°) is the vacuum 
Lagrangian [Eq. (113)], and we introduced the electric 
susceptibility x = e — 1. Then the wave energy is 



1 
16tt 



E -(wfeVE- 



(131) 



or, equivalents [due to E • e ■ E = |B| 2 ; cf. Eq. (119)] 



£ = 



1 



167ru; 



E -( W 2 e') w -E. 



(132) 



Also, as usual, the canonical momentum density equals 



V = k£/u. 



(133) 



One can show, using Eq. (117), that the latter is just 
a more concise form of the corresponding expression in 
Ref. [148]. Contrary to Ref. [149], calculated there is thus 
not the total, but only the canonical momentum (and the 
canonical energy) of the wave; see also Refs. [150, 151]. 

Following Ref. [146], let us also separate the energy 
flux Q into the electromagnetic part and the kinetic part. 
Specifically, using Q — — cj£k, one can write it as Q — 
S + 7C, where S = — u)S^ , and 



K =~ l!^*-*^ 



(134) 
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The latter is recognized as the energy flux density caused 
by the presence of the medium [146, Chap. 4], whereas 



|(k x E*) • (k x E) + (k x E*) • (k x E)| 



yields the dispersion relation in the form w' : 



w; 



and 



Wirui 



Wttuj 



|k • [E* x (k x E)] + k • [E x (k x E*)] | 
E* x (k x E) + E x (k x E*)| 



lQ'KUl I 



— Re (E x B 

8tt v 



(135) 



is the time-averaged Poynting vector, i.e., the "vacuum 
part" of Q. [Here we substituted Eq. (113) and used un- 
derlining to specify where the differentiation applies.] Re- 
calling that Q = £v g , one then also recovers the known 
formula [146, Chap. 4] 



(S + K)/£. 



(136) 



Below, several examples of specific dielectrics will be dis- 
cussed to illustrate these and earlier formulas. 



E. Examples 

Waves in fluids at rest. - Let us first summarize, 
using the results of Sec. VI D, the densities of the kinetic 
energy, momentum, and angular momentum for a wave 
in a fluid dielectric at rest: 

e = £, p={S + K)/c 2 , fi=[xx (S + K)]/c 2 . 

Electromagnetic waves in vacuum can be considered as 
a special case and have uj 2 = c 2 k 2 , so v g = c 2 \lJuj = ck, 
where k = k/k. Then £ = |£| 2 /(87r); c 2 V and Q are 
equal to each other (so the EMT is symmetric, and 
canonical quantities coincide with kinetic quantities) and 
<S = kc£; also, II = kk£ equals minus the time-averaged 
Maxwell stress tensor. Thus, in this case, the wave EMT 
coincides with the electromagnetic stress-energy tensor 
[71, Sec. 32]. Besides, M. and fi are both equal to 
x x S/c 2 , in agreement with the traditional definition 
of the wave angular momentum density in vacuum [135]. 
Electrostatic waves in beams and plasmas. — To also 
illustrate waves in moving dielectrics, consider further a 
relativistic electron beam with electrostatic oscillations 
seeded on it with k parallel to the beam velocity v. As- 
suming that the beam is cold, it acts as a fluid medium 
isotropic in its rest frame. (In fact, as long as the dy- 
namics is one-dimensional, having isotropy is inessen- 
tial.) Since there is no dissipation in a cold beam, we 
revert here to using primes as a reference to that frame. 
The proper ponderomotive potential, same for all elec- 
trons, can then be written as $' = e 2 \E\ 2 / {Am c uj' 2 ) [107], 
where e and m c are the electron charge and mass; re- 
member also that the longitudinal field satisfies E = E' . 
Then e' = 1 - lj' 2 /lu' 2 (cf. Sec. VIA), where u' p = 
(4:7rn' e 2 / rric) 1 ' 2 , and n' c is the beam proper density. This 



= in particular. From Eq. (132) we hence get 



£' = \E\ 2 /(8tt),soU' 
and (91) lead to 

e = (3 7 2 - 1) 



£'/2 [cf. Eq. (78)]; then Eqs. (90) 



\El 
16tT 



3 7 2 v \E\ 



16tt 



(137) 



Since u/ = j(w — kv), the dispersion relation in the lab- 
oratory frame is uj = uj' p /"f + kv, and v g = v; therefore, 
p = [1 - l/(37 2 )]" 1 ew g /c 2 , or p w 3ev g /(2c 2 ) at 7 w 1. 
This differs by a factor 3/2 from Abraham's result (ex- 
pected at nonrelativistic velocities), but the discrepancy 
can be readily explained. Recall that Abraham's limit 
[Eq. (102)] is derived by neglecting v but retaining v g . 
For the electrostatic waves in question, this is legitimate 
only when v equals zero exactly; in that case, just like 
Abraham's formula, Eq. (137) yields p = 0. Note that 
this result is also understood from the fact that, at v = 0, 
each nonrelativistic electron has a zero average momen- 
tum, while the instantaneous momentum of the electro- 
static field (proportional to the instantaneous Poynting 
vector [145, Sec. 6.9]) is zero identically. 

Now consider thermal effects, which will render beam 
oscillations identical to Langmuir waves in warm col- 
lisionless plasma [146, Chap. 8]. We can take the 
electron ponderomotive potential in the form $ = 
e 2 \E\ 2 /[Am c (<jj — ku) 2 ], where u is the average veloc- 
ity of an individual particle; then the general Bohm- 
Gross dispersion relation is recovered [107]. As usual, 
we assume that the electron thermal velocity i>t satisfies 
£0 = u/(kvr) 3> 1, and the electron average velocity is 
zero, so one gets uj 2 w uj 2 + ik 2 v\ and v g w ikv^/u. 
(See Ref. [152] for how to deal with the singularity at 
u = oj/k.) The canonical energy and momentum den- 
sities are then found to be £ s=s (w p /u;) 2 |_E| 2 /(87r) and 
V = k£/uj. This time we yet cannot apply Abraham's 
formula [Eqs. (102)] to further find e and p, because, as 
an ensemble of electrons with different velocities, warm 
plasma is not a simple fluid. Thus we revert to Eqs. (75) 
and (76), which give 



Aew — 



Hn 



A,-* 4 !f. (138, 

UJ UJ A 87T 



Hence, Eqs. (73) finally yield e = |£| 2 /(8tt) and p = 0, 
at least up to terms of the order of £q~ 4 . 

Since p is independent of the amplitude, the evolution 
of Langmuir waves does not affect the electron average 
momentum (zero in our example). This agrees with the 
standard quasilinear theory [153]; however, an explana- 
tion is due regarding the details. Notice that a Langmuir 
envelope seems to transport canonical momentum due 
to nonzero v s , but does not seem to transport any ki- 
netic momentum, as p is zero; on the other hand, one 
may expect that the total momentum in any volume en- 
closing the pulse is defined unambiguously in quiescent 
plasma, so one arrives to paradox, much like the AMC. 
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To resolve this, recall that above we assumed zero aver- 
age velocity inside the wave, so the outside plasma had 
to be flowing, as the amplitude gradient at the inter- 
face causes pondcromotive acceleration. Accordingly, to 
keep the outside velocity zero, we must allow for a non- 
vanishing flow of electrons inside the pulse, and that is 
precisely where the total momentum is stored. In other 
words, Langmuir waves do transport momentum through 
ambient plasma, just like other waves. This, if anything, 
may serve as a particularly vivid illustration for how the 
abstract resolution of the AMC relates to experiment. 

Kinetic waves. - The small but finite Ae/£ ~ £q~ 
that we found for Langmuir waves is a purely kinetic 
effect, which does not fit into the resting-fluid model, 
Eq. (92). For other waves in collisionless plasma, hav- 
ing finite £o generally renders the fluid model inappli- 
cable too [146]. (Same applies to collisionless gas [154], 
but collisional media are more forgiving.) Light waves in 
particular are affected at relativistic temperatures, which 
are possible in astrophysical settings and also in labora- 
tory, say, in counterpropagating relativistic beams. But 
even more easily the fluid approximation can be broken 
for waves tuned in resonance with medium natural oscil- 
lations. Replacing £c>) the controlling parameters in this 
case are each of & = (ui — Kl)/(kvT), where il is the 
natural frequency, and I is an arbitrary integer. This is 
well-known for a cyclotron resonance [146], but interac- 
tions at other, even quantum resonances are similar [67]. 
For example, one can show that Ae/£ ~ £j~ when a 
medium, resting on average, consists of two counterprop- 
agating low-density beams of nonrelativistic oscillators 
with u> ~ fi. However, elaborating on this is beyond the 
scope of our paper, and our intention here is only to rc- 
emphasize that Abraham's formulas for e and p may not 
apply even when the medium average velocity is zero. 



F. Quantum interpretation 

Let us now recast our findings in the photon language. 
To do so, however, we first need to recall what a photon 
actually is, and we start with one in vacuum. 

A vacuum photon is defined as an elementary excita- 
tion of electromagnetic field, in a certain energy eigen- 
state that determines both the spatial structure of the 
mode and also its frequency. For example, the mode can 
be a standing wave with a certain number of nodes in 
a finite-size box. Yet if the box size is large enough or 
infinite (as it would be for free field), the energy gap be- 
tween neighboring modes is negligible, so we can loosen 
the definition. Specifically, neighboring modes can then 
be excited coherently such that their interference pro- 
duces a propagating envelope with a size negligible for 
our purposes. Precisely these envelopes we call photons, 
much like it is often done for regular particles in quasi- 
classical theories [155]. Hence the GO approximation is 
also adopted automatically. 

Now that we have introduced a photon in vacuum, let 



us define one in a dispersive medium. It is natural to 
continue thinking of a photon as a quasiparticle in this 
case, i.e., an object that is conserved within the GO ap- 
proximation. To understand what it means, consider a 
vacuum photon entering a dispersive medium. We will 
assume no scattering at the boundary, to ensure that GO 
remains valid; i.e., (i) there is no reflection, and (ii) the 
wave continues inside the medium as a quasimonochro- 
matic field. In other words, only one of the internal 
eigenwaves, or branches of the dispersion relation, is ex- 
cited, while others are nonresonant to the incident pho- 
ton and thus remain quiescent. In the general case, when 
the medium is both inhomogeneous and time-dependent, 
there is exactly one GO integral: the wave action. There- 
fore, it is the action conservation that must be associated 
with the photon conservation. And since all the action 
belongs to a single branch, so must all photons. 

As one moves away from the boundary, the initial 
branch can adiabatically transform into something very 
different from the original electromagnetic wave. In that 
case it may be more natural to assign a different name 
to its elementary excitations, i.e., call them not photons 
but, say, plasmons or polaritons. Remember, however, 
that a wave with given u> and k can be associated with 
only one type of quasiparticles, whatever they are called, 
and those must account for oscillations of both the field 
and the medium. (In other words, GO photons cannot 
be separated from GO polaritons in principle, as long 
as a single branch of the dispersion curve is considered.) 
Absent a better term [156], we thus adopt "photon" as 
the generic term for all such elementary excitations. 

Much like an electron interacting with a vector poten- 
tial, a photon defined this way can hence be assigned 
two different momenta. The first, canonical momentum 
P is the one that enters Hamilton's equations [Eqs. (41)] 
and is due to the corresponding Noether symmetry of 
the wave subsystem. It is given by P — Kk; surprisingly, 
this expression holds in any dissipationless medium what- 
soever and even for nonlinear waves. (Not so for the 
canonical energy flu).) The second, kinetic momentum 
p is defined as the amplitude-dependent part of the av- 
erage momentum of the whole physical system divided 
by the total number of photons. It is due to the WMS 
Lorentz invariance, or, in other words, the corresponding 
Noether symmetry of the whole system, which includes 
the medium. In particular, for isotropic fluid medium 
at rest we show that p = fiwv g /c 2 , in agreement with 
Abraham [64]. Remember, however, that this result is 
less general than Minkowski's formula for P; for exam- 
ple, it does not apply in moving media (such as beams) 
and may not hold at relativistic temperatures. 

Note, finally, that the per-photon average momenta of 
fields and particles taken separately are not associated 
with any conservation laws, in contrast with P and p. 
Therefore they cannot be attributed to any conserved 
quasiparticles and, in this sense, are less meaningful. 
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VII. DISCUSSION 

In this paper, we pose classical GO axiomatically 
within the field-theoretical approach, while extending it 
to account for dissipation. The concept of a photon in a 
dielectric medium is introduced, and photon properties 
are calculated unambiguously. In particular, the canon- 
ical and kinetic momenta and angular momenta carried 
by a photon, as well as the two corresponding EMTs, 
are derived from first principles of Lagrangian mechanics. 
Responding to the questions posed in Ref. [35], we thus 
resolve the Abraham-Minkowski controversy pertaining 
to the definition of the photon energy-momentum and 
spin, clarify the applicability of Minkowski's and Abra- 
ham's formulas, and find corrections to them for various 
media (including cold, warm, and relativistic media). 

Furthermore, the axiomatic formalism that we adopt 
leads also to other new results, since it applies not just 
to electromagnetic waves but to any linear waves in any 
dispersive media, including ones yet to be discovered. 
For example, the EMTs of acoustic waves follow, plus 
the phonon spin - - and these are only some of the 
AMC-related issues that otherwise remain under debate 
[157-159]. We show, in fact, that all wave mechanical 
properties flow from little more than the wave defini- 
tion, whereas the specific internal physics of a medium is 
largely irrelevant. Since definitions are not really a mat- 
ter of verification, experimental resolution of the AMC 
is obviated; at least, one may call into question exactly 
what aspect of the AMC an experiment might resolve. In 
addition, derivations of the EMTs and angular momenta 
for specific waves are obviated too, as they are subsumed 
under our more general theory. For instance, (i) the 
canonical EMT that we present is general (Sec. IV), 
(ii) the force produced by a wave on a fluid-like medium 
is independent of the precise constitution of the medium 
(Sec. VC), and (iii) even understanding of kinetic waves 
does not require solving any field equations (Sec. VIE). 

Let us emphasize, finally, that these findings have been 
made possible by our adopting the abstract Lagrangian 
formulation, whose utility for understanding general lin- 
ear waves should hence be obvious. We also suggest 
Refs. [66, 144, 160, 161] as recent illustrations of how ad- 
vantageous it is for analyzing the linear wave dynamics 
in plasmas. In addition, the nonlinear Lagrangian the- 
ory has been getting a new spin recently, namely, in the 
context of plasma waves carrying autoresonantly trapped 
particles [106-109, 162]. Those waves are unique in the 
sense that the trapped-particle nonlinearity is, within a 
certain range of parameters, independent of the wave am- 
plitude or even gets strengthened when the amplitude 
decreases. Hence the traditional intuition and standard 
perturbative approaches often fail when applied to such 
waves, whereas the axiomatic GO not only holds but also 
offers the advantage of tractability [109]. In particular, 
understanding how the wave momentum relates to the 
plasma Lagrangian yields precise quantitative predictions 
of (quite nontrivial) evolution of waves with trapped par- 



ticles in nonstationary plasma [163]. Thus, with this pa- 
per, we would like to attract attention to the axiomatic 
GO itself as a remarkably convenient framework for ana- 
lyzing the wave basic physics, not only in the AMC con- 
text but also in the context of solving practical problems. 
The work was supported by the NNSA SSAA Pro- 
gram through DOE Research Grant No. DE274-FG52- 
08NA28553 and by the U.S. DOE through Contract No. 
DE-AC02-09CH11466. 



Appendix A: Auxiliary function G M 

Here, we summarize the properties of a dimensionless 
matrix function G ^ introduced in Eq. (83). First of all, 
notice an obvious equality 

(A- 1 )\(v) = A\(-v), (Al) 

which can also be checked by confirming that 

A" A (-v)A A „(v) = SS. (A2) 

Then a direct calculation yields 

G° ol = 0, (A3) 

G l o/=A J ; , GPu=rmVu (A4) 

G i J -« = (^t; i -^i; i )(7/c)/(7+l), (A5) 

where we introduced the notation 

G\ t = (G\), = (c/7)(A- 1 )\ (dA%/dv l ). (A6) 

(In particular, notice that the three Z-components, G v M j, 
at v = happen to be the well-known Lorentz boost 
generators.) Let us now define the function 

G v ^i = g v \G fj,i. (A7) 

Due to Eqs. (16), one finds the latter to be 

Gaoi = 0, (A8) 

Gou = -G m = -THjA j i, (A9) 

Giji = {riuvj - 7?j«Vi)(7/c)/(7 + 1), (A10) 

so, in particular, 

Gi/(i = — G^„, (G^„); = G^ v i. (All) 

Hence Eq. (82) becomes 

9 V £' = -jG\g x ,T'^/c = - 19 „ x G\r^/c 

= -jG^T'^/c = jG^T'^/c, (A12) 

which is exactly Eq. (84), where we substituted Eq. (A8) 
and introduced 

% = (7/c)(G M ir 0i + G 0il T' i0 ), <B, = ('y/c)G ijl T' ji . 



Finally, due to Eqs. (A9) and (A10), 



<$i = y( Vij A j i£'v^/c 2 - thjAPiV- 



jA^iS'v'Jc 



n 



[ 7 A • (£'< /c 2 V% 
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[( 7 +l)c 2 /7 2 ]^ 

= (ViiVj - r)nVi)V' 3 v'l = (v'giVjV 3 - V[viv'l) 
= [vUy.T')--P'(v.y')] l = [vx(v'xV')}i 



whence Eqs. (85) and (86) readily follow. 
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